Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes

نویسندگان

چکیده

Studies have shown evolution strategies (ES) to be a promising approach for reinforcement learning (RL) with deep neural networks. However, the issue of high sample complexity persists in applications ES RL over long horizons. This paper is first address shortcoming todays methods via novel neuroevolutionary multitasking (NuEMT) algorithm, designed transfer information from set auxiliary tasks (of short episode length) target (full task at hand. The tasks, extracted target, allow an agent update and quickly evaluate policies on shorter time evolved skills are then transferred guide longer harder towards optimal policy. We demonstrate that NuEMT algorithm achieves data-efficient evolutionary RL, reducing expensive agent-environment interaction data requirements. Our key algorithmic contribution this setting introduce, time, multitask mechanism based statistical importance sampling technique. In addition, adaptive resource allocation strategy utilized assign computational resources their gleaned usefulness. Experiments range continuous control OpenAI Gym confirm our proposed efficient compared recent baselines.

منابع مشابه

Neuroevolution strategies for episodic reinforcement learning

Article history: Received 30 April 2009 Available online xxxx

متن کامل

Reinforcement Learning with Long Short-Term Memory

This paper presents reinforcement learning with a Long ShortTerm Memory recurrent neural network: RL-LSTM. Model-free RL-LSTM using Advantage( ) learning and directed exploration can solve non-Markovian tasks with long-term dependencies between relevant events. This is demonstrated in a T-maze task, as well as in a di cult variation of the pole balancing task.

متن کامل

Modular Multitask Reinforcement Learning with Policy Sketches

We describe a framework for multitask deep reinforcement learning guided by policy sketches. Sketches annotate each task with a sequence of named subtasks, providing high-level structural relationships among tasks, but not providing the detailed guidance required by previous work on learning policy abstractions for RL (e.g. intermediate rewards, subtask completion signals, or intrinsic motivati...

متن کامل

Distral: Robust multitask reinforcement learning

Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network parameters, where efficiency may be improved through transfer across related tasks. In practice, however, this is not usually observed, because gradients fro...

متن کامل

Multitask model-free reinforcement learning

Conventional model-free reinforcement learning algorithms are limited to performing only one task, such as navigating to a single goal location in a maze, or reaching one goal state in the Tower of Hanoi block manipulation problem. It has been thought that only model-based algorithms could perform goal-directed actions, optimally adapting to new reward structures in the environment. In this wor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Cognitive and Developmental Systems

سال: 2022

ISSN: ['2379-8920', '2379-8939']

DOI: https://doi.org/10.1109/tcds.2022.3221805